Browsing Document Collections: Automatically Organizing Digital Libraries and Hypermedia using the Gray Code

نویسنده

  • Robert M. Losee
چکیده

Relevance and economic feedback may be used to produce an ordering of documents that supports browsing in hypermedia and digital libraries. Document classification based on the Gray code provides paths through the entire collection, each path traversing each node in the set of documents exactly once. Systems organizing documents based on weighted and unweighted Gray codes are examined. Relevance feedback is used to conceptually organize the collection for an individual to browse, based on that individual’s interests and information needs, as reflected by their relevance judgements and user supplied economic preferences. We apply Bayesian learning theory to estimating the characteristics of documents of interest to the user and supply an analytic model of browsing performance, based on minimizing the Expected Browsing Distance (EBD). Economic feedback may be used to change the ordering of documents to benefit the user. Using these techniques, a hypermedia or digital library may order any and all available documents, not just those examined, based on the information provided by the searcher or people with similar interests.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Automatic Content-based Organization of Multilingual Digital Libraries: an English, French, and German View of the Russian Information Agency Novosti News

In this paper we present the application of the SOMLib digital library system to a multilingual document corpus from the Russian Information Agency Novosti. News articles in Russian, English, and German are automatically organized into separate topic hierarchies using a novel unsupervised neural network, namely the Growing Hierarchical Self-Organizing Map. Furthermore, machine translation is us...

متن کامل

Map-based Interfaces for Information Management in Large Text Collections

The Self-Organising Map (SOM) has been proposed as an alternative interface for exploring Digital Libraries or other big document collections, in addition to conventional search and browsing. With advanced visualisations assisting the user in understanding the contents of the map and its structure, as well as advanced interaction modes as zooming, panning and area selection, the SOM becomes a f...

متن کامل

Browsing Digital Libraries with the Aid of Self-Organizing Maps

-Powerful methods for exploring and searching collections of free-form textual documents are needed to control the flood of digital information emerging from various sources. In this article we present a method, WEBSOM, for automatic organization of document collections based on full-text analysis using the Self-Organizing Map. The document collection is ordered on the map in such a way that si...

متن کامل

Progressive Discovery of Document Content

As the World-Wide Web, digital libraries and similar systems become ubiquitous, increasingly effective techniques are developed to help readers locate useful documents. Information retrieval techniques for indexing and querying documents provide for accurate matching of documents against user queries. Graphical and textual query interfaces allow users to more easily and effectively specify thei...

متن کامل

Phrasier: An Interactive System for Linking and Browsing Within Document Collections Using Keyphrases

When documents are collected together from diverse sources they are unlikely to contain useful hypertext links to support browsing amongst them. Manual, or semi-automated link creation is often infeasibly time-consuming for large document collections. We present Phrasier, an interactive system which automatically introduces links to related material into documents as the user browses and querie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Process. Manage.

دوره 33  شماره 

صفحات  -

تاریخ انتشار 1997